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ABSTRACT 



A statistical technique is tested for adjusting the analog 
storm tracks used in the Northwest Pacific Analog Tropical 
Cyclone Forecasting Program (TYAN78) . In the present version 
the tracks are rotated based on linear extrapolation of the 
deviations of the past 12-h motion for the analog and the 
current storm. The technique proposed here uses multiple 
linear regression equations to adjust the 12-h through 72-h 
analog positions. Predictors include translation speeds 
along the analog track from the -36-h to 72-h positions plus 
the comparison of the past 12- , 24- and 36-h translated posi- 
tions of the analog with the current storm. By simply aver- 
aging the regression adjusted positions, the vector forecast 
errors were smaller than the TYAN78 errors. Both the TYAN78 
and regression adjusted forecast errors were less than the 
official JTWC forecast errors. The improvement in the regres- 
sion adjusted forecasts was in a smaller standard deviation 
relative to the mean position. For example, in the dependent 
sample the mean errors at 72 h for the official forecast, 

TYAN78 forecast and regression adjusted forecast were 438, 321, 
and 288 nautical miles, respectively. The corresponding 
standard deviations were 198, 151, 162 nautical miles. 
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I. 



INTRODUCTION 



Analog tropical cyclone prediction models have been in 
operational use since 1970 (Jarrell and Wagoner, 1973) . The 
basic principle of analog models is to use the mean position 
of all historical storms which have characteristics similar 
to the current storm. Probability ellipses are generated as 
part of the output forecast based on means and standard devia- 
tions of the latitudes and longitudes and the correlation co- 
efficient between them (Jarrell and Wagoner, 1973) . 

Fleet Weather Central/ Joint Typhoon Warning Center in Guam 
(FWC/JTWC) has used an analog model named TYFOON since 1970. 
TYFOON used a weighted mean latitude and longitude for the 
predicted 24-, 48-, and 72-h positions. A modified version of 
TYFOON named TYFOON-72 was instituted in 1972. The important 
modifications were improved input/output efficiency, derivation 
of adjustment coefficients and an increased momber of candidate 
analogs. The historical data base for TYFOON and TYFOON-72 
was the period 1949-1969. In 1975 a further modification of 
the analog model (TYFN-75) was introduced that incorporated a 
data base of storm tracks for the Northwest Pacific for the 
years 1945 to 1973. The most recent update and modification 
to the Analog Tropical Cyclone Forecast Program is named 
TYAN78. This version is the result of conversion from the 
Control Data Corporation (CDC) 3100 computer system to the 
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CDC 6500 system (Fleet Numerical Oceanography Center 6500/ 

Naval Environmental Display Station-1) . In this version, as 
in the earlier ones, the analog tracks are translated so that 
the best position of the analog is co-located with the current 
position. The remaining analog positions are adjusted accord- 
ingly and the track is then rotated based on linear extra- 
polation of the deviations of the past 12-h motion for the 
analog and the current storm. An updated data base is used in 
TYAN78 that includes storm tracks for the Northwest Pacific 
from 1945 through 1976. New weighting coef f iciencts are used 
based on these data. 

The purpose of this thesis was to derive multiple regression 
equations, based on analogs selected by TYAN78 for the North- 
west Pacific, to adjust the 12-h through 72-h analog positions. 
The objective of using regression-adjusted positions is to 
decrease the dispersion about the actual position. 

In addition to simply averaging the regression-adjusted 
positions, a weighting scheme based on standard deviations 
of the vector lengths found in comparing the 12- , 24-, and 
36-h past positions of the analogs to the current storm was 
tested. The results showed no improvement over the averaged 
regression-adjusted analog forecast for the 24- and 48-h fore- 
casts. There was a very slight improvement in the 72-h fore- 
cast that indicates a step toward possibly using recent track 
information to select the best analogs for use in TYAN78. 
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II. MODEL AND SAMPLE 



A. THE TYAN78 MODEL 

The model used to generate the data for this research is 
the TYAN78 model resident on the FNOC CDC 6500 computer. The 
TYAN78 program is a result of a conversion and combination of 
the five analog tropical cyclone models used by FWC/JTWC, 

Guam (ODSI, 1978). The program conversion was from the CDC 3100 
system to the CDC 6500 system. In addition to the conversion, 
five tropical cyclone analog models were combined to form 
TYAN78. The five were analog forecast models for the Northwest 
Pacific, Northeast Pacific, Southwest Pacific, Southwest 
Indian and North Indian Ocean regions. 

The TYAN78 program was designed to operate without computer 
operator intervention. The forecaster enters the required 
input information through Automatic Response to Query (ARQ) 
and receives a forecast for tropical cyclones in the afore- 
mentioned ocean basins. For the Northwest Pacific basin, the 
forecaster has the option of selecting only recurver analogs, 
only non-recurver analogs or a combination of all potential 
analogs. Separate TYAN78 forecasts are produced for each case. 
In the present work the combined sample was chosen. 

The TYAN78 program selects potential analogs as described 
by Jarrell and Wagoner (1973) , except that the time envelope 
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is +35 days rather than +50 days. Basically, the model searches 
the history file for all storms that occurred within the time 
envelope relative to the current (input) storm. Then the model 
checks for analog positions that occurred within +2.5 degrees 
latitude and +5.0 degrees longitude, as illustrated in figure 1. 
The time and space envelope will include more than one position 
along the storm track. A similarity index is calculated for 
each position within the envelope. The similarity index is 
defined as SI = Z W. • I P . -P . I, where W. is the weight factor 
associated with the ith parameter, P^^ is the value of the 
ith parameter of the current storm and P^^ is the value of 
the ith parameter of the analog position. The values were 

originally specified using "educated guesses" for each para- 
meter (Jarrell and Wagoner, 1973) . The parameters used are 
listed in Table I. The "best" position for each analog storm 
is defined as the one with the lowest similarity index (Jarrell 
and Wagoner, 1973) . The latitude and longitude correction 
necessary to translate the "best" position of the analog to 
the current position of the input storm is then calculated. 

This vector displacement is then added to all the points along 
the corresponding analog track. Thus the "best" positions of 
all analogs are initially co-located with the current storm 
position. The TYAN78 program then ranks the potential analog 
storms with the highest rank given to the storm with the 
lowest similarity index. This ranking is used in finding a 
weighted mean position for each set of forecasts. 
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Map of the dependent sample input "current" positions. 




TABLE 1 



Parameters for each 6-h position of the tropical storms 
and typhoons from which similarity indices are calculated 
(Jarrell and Wagoner, 1973) . 



Parameter 



Units 



Julian date 

Latitude 

Longitude 

12 h direction of movement 
12 h speed of movement 
18 h direction of movement 
18 h speed of movement 
24 h direction of movement 
24 h speed of movement 
48 h direction of movement 
48 h speed of movement 
Radius of outer closed isobar 
12 h change of radius 
Sea-level pressure 
(minimum) 

12 h change of min. SLP 
Maximum wind speed 
Minimum 700-mb height 
Latitude of 700-mb ridge 
to north 

Height of 700-mb ridge 
to north 

Longitude of 700-mb -trough 
to the west at 35N 
Height of 700-mb trough 
at 35N 



nearest tenth of degree 
nearest tenth of degree 
nearest degree 
nearest knot 
nearest degree 
nearest knot 
nearest degree 
nearest knot 
nearest degree 
nearest knot 
nearest degree 
nearest degree 

nearest millibar 
nearest millibar 
nearest 5 knots 
nearest decameter 

nearest degree 

nearest decameter 

nearest degree 

nearest decameter 
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In this research the TYAN78 model was used to the point 
of selecting and ranking the analogs. The normal TYAN78 fore- 
casts were also collected for later comparison with the new 
methods tested. 

The Program Maintenance Manual (ODSI, 1978) provided a 
means for having some of the coefficients and similarity in- 
dices printed out. A list of the top five analogs with the 
best position identified was also available as an option. 

(Note that this option is now part of the output to the fore- 
caster after a recent modification by ODSI to TYAN78) . The 
TYAN78 program was modified to list all analogs that were 
included in the search, and to print out the top 20 analogs 
including the "best" position for each storm. From this list 
of potential analogs, the positions for all analogs which 
included the -12-, -24-, -36-h positions and the +12-, 24-, 
36-,28-,60-,72-h positions were retained. 

B. SELECTION OF INPUT STORMS 

The data base for the Northwest Pacific includes storms 
from 1945 through 1976. It is thus possible that use of a 
storm during this period would result in its selection as a 
potential analog. A sample storm from 1968 (NINA 1200GMT 22 
November as initial position) was used in the ARQ input for- 
mat to generate the forecast and print out the top five analogs . 
The option of having all three forecast modes was entered in 
the ARQ. In the straight option, the input storm was selected 
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as an analog and was ranked number two rather than one , and 
the "best" position was the current (input) position. How- 
ever, in the option that considered all storms, the input 
storm was ranked nuinber five, and the "best?' position was 18 h 
after the input position. Because of these results, it was 
decided not to use any sample storms that occurred during the 
data base period (1945-1976). Thus, the dependent sample 
consists of storms from 1977 and 1978. 

From the 1977 and 1978 data, 21 storms were selected to 
give a representative geographic distribution (see figure 2) . 

For each of these storms, the name, date-time group, initial 
position and 12-h past position (all minimum ARQ requirements) 
were entered. Since all of the sample storms had at least a 
48-h history, the 18-, 24-, and 48-h past positions were also 
included. All position information was best track information 
from the Annual Typhoon Reports (1977 and 1978) . The maximum 
surface wind at the initial time was used with initial lati- 
tude to derive a minimum sea level pressure, and a 700-mb 
height from figure 9-15 in Atkinson (1971) . Other parameters 
such as the latitutde of the 700-mb ridge (see Table 1) were not 
available as input to the analog selection method. 

Using the above parameters as the TYAN78 ARQ input, 21 
storms were processed on the FNOC computer. Appendix A 
describes the procedures used for processing the data. The 
output thus included 21 forecasts for 24, 48, and 72 h. The 
listing of the 20 best analogs for each 
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storm included a total of 308 analogs with at least a 36-h 
history. Only 308 analogs were available because: 1) not all 

input storms had 20 potential analogs; and 2) not all of the 
potential analogs had the 36-h history required for appli- 
cation of the statistical method. This sample of 308 analogs 
and the associated best tracks of the 21 input storms was 
used to develop the multiple linear regression equations. 
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Ill . DEVELOPMENT OF MULTIPLE LINEAR REGRESSION EQUATIONS 



As described above, each of the potential analogs was trans- 
lated so that the "best" position was co-located with the 
initial position of the corresponding input storm. Consider 
an analog storm track and the best track of the input storm 
as in figure 3. The best track positions are considered known 
for the dependent sample used to develop the regression equa- 
tions, and represent the desired forecast positions after adjust- 
ment of the analog positions. Thus the predictands are the 
zonal and meridional displacements to be added to the analog 
positions. A total of 12 regression equations would normally 
comprise these sets. However, a provision is made for alter- 
native regression equations if the analog did not persist for 
72 h. Thus there is one set each for when the analog storm 
persisted 72, 60, 48, 36, or 24 h past the best position. 

The predictors used in this research are based on the work 
by Elsberry and Frill (1979) . They were able to improve the 
Tropical Cyclone Model (TCM) forecasts when the tracks were 
adjusted with regression equations that included past position 
information. A similar approach is used in this study, which 
required that only analogs with a 36-h history be retained in 
the sample. A schematic of the predictors, with line segments 
representing displacement vectors, is shown in figure 4. Table 
2 lists the predictors and predictands. These vectors were 
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converted to zonal and meridional velocity components.. Zonal 
and meridional velocity predictors were also computed for 
-12- , -24-, and -36-h positions of the analog compared to the 
input storm's corresponding past positions. Thus there are 
52 potential predictors. 

The regression equations were generated by the Stepwise 

Regression Program of the Biomedical Computer Programs package 

(Dixon and Brown, 1977) . The F-to-enter and F-to-remove values 

were specified as 4.0 and 3.9, respectively. These are the 

limits at which stepping is terminated. That is, no more 

variables are entered or removed if the calculated F-to-enter 

or F-to-remove fall below 4.0 or 3.9, respectively. In the 

final choice of equations, a maximum of five predictors was 

allowed, although no predictor was included if it contributed 

less than .01 to the increase in the multiple correlation co- 

2 

efficient squared (r ) . The equations are presented in Table 
3 , together with the multiple correlation coefficient squared, 
the F-ratio and degrees of freedom, and the standard error of 
estimate. The F-ratio used here is a test statistic defined 
as the ratio of the mean square due to regression to the mean 
square due to error. It is available from the analysis of 
variance table provided by the stepwise regression program. 

Under the null hypothesis this statistic has an F distribu- 
tion with numerator degrees of freedom equal to the number of 
variables selected thus far. The denominator degrees of free- 
dom equals the sample size minus the number of variables 
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Schematic of the predictands (AX^2/ ^^12' ^^ 24 ' 

etc.) which are the differences between best 
track and analog positions after translation. 
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4. Illustration of the intervals over which the pre- 
dictors were computed.- Positions are shown at 
12-h intervals. 
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TABLE 2 



Predictors/predictands used to develop regression equations 
for analog track adjustment 

1. Predictands: AX, AY (nautical miles) 

Times at which predictands are computed: 

12, 24, 36, 48, 60, 72 hrs 

2. Predictors: u, v (knots) 

Time intervals over which each predictor was computed: 
00-12, 00-24, 00-36, 00-48, 00-60, 00-72, 12-24, 
12-36, 24-36, 24-48, 36-48, 36-60, 48-60, 48-72, 
60-72 

00-M12, 00-M24, 00-M36, M12-M24, M12-M36, M12-12, 
M24-24, M24-M36 

where 00 = best (current) position 

M24 = 24 hours before current position 
12 = 12 hours from current position 
M12-B12, M24-B24, M36-B36 

where B24= 24 h past position of current storm. 
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selected minus one. Further details of the F distribution may 
be found in any statistics text. The null hypothesis that the 
set of coefficients for each equation is zero can be rejected 
for the F statistics and degrees of freedom shown. That is, 
the variables may be regarded as significant in predicting 
their respective predictands at a significance level of .01. 

One should note that predictors based on past positions were 
included in each of the equations. Note especially the occur- 
rence of predictors involving the -12-h and current (00) posi- 
tion of the analog. The predictors that compare the -12-h 
position of the analog with the -12-h position of the current 
storm are also chosen frequently. Thus the regression equa- 
tion approach developed here uses more than just the past 12-h 
motion to adjust the analog tracks. 

The table of equations only lists those regression sets 
for analog storms that existed either 72 h, 48 h, or 24 h past 
the time of the best position. Equations for intermediate 
times such as 60 h and 36 h were also developed and are evalu- 
ated below, but they are not listed here. The tables include 

2 

the multiple correlation coefficient squared (r ) , F-ratio 
with numerator and denominator degrees of freedom, and stan- 
dard error of estimate at the step the equation was developed. 

2 

For each of the equations the computed F-ratio and r are con- 
sistently less for the zonal displacement prediction than for 
the meridional displacement. This indicates that the linear 
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DYER24 = 69.323 - 1.789 (BYER12) + 

21.673 (VYM200) - 24.335 (VY0024) + .81 141.9 46. 

.412 (BYER36) - 11.117 (VYM600) (5,171) 
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Same as 3a except for analog storms that persisted to 24 h past "best" position. 
These are based on 21 storms with 308 analogs. 
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dependence of the meriodional displacements on the predictors 
is stronger than that of the zonal displacements. 

To make the regression-adjusted forecasts, the zonal and 
meriodional displacements calculated from the multiple linear 
regression equation were used to correct the analog storm's 24-, 
48- and 72-h positions. These adjusted positions were then 
averaged to provide a forecast position. Note that these fore- 
casts are based on 20 or fewer analog storms and that the ana- 
logs must have had at least a 36— h history before the "best" 
position. This is not the same number of analogs as TYAN78 
would evaluate, and these forecasts are not weighted as in 
TYAN78. 

A weighting scheme based on the standard deviations of the 
vector lengths between the current storm and analog storm -12-, 
-24-, and -36-h positions was also tested. The standard devia- 
tions were obtained from the sample of 308 analog storms. One 
weight was assigned to all corrections for zonal and meridional 
displacements. The equation is as follows: 



WGT = 



^-12 ^-24 ^-36 



(BX12^ + BY12^) ^ + 



(BX24^ + BY24^) 



2 2 ^ 
(BX36 + BY36 ) 



where the sample standard deviation of the vector dis- 

tance between analog storm and current storm -12-h positions. 

2 2 

Here (BX12 + BY12 ) is the calculated vector distance for the 

-12-h position comparisons for the individual analog storms. 
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IV. RESULTS 



A. DEPENDENT SAMPLE TEST 

The multiple linear regression equations were initially 
tested using the dependent sample. Included in the dependent 
sample of 21 storms were two that did not have official fore- 
cast error statistics. These two storms were deleted from the 
evaluation below. A list of storms used is presented in Table 7. 

Figure 5 illustrates an example of the analog storm positions 
before and after the regression adjustments compared to the 
best track of the input storm. The analog storm has been trans- 
lated so that its "best" position is co-located with the cur- 
rent position of the input storm. The regression adjustment 
overcorrects somewhat for this example, but it is expected that 
the adjustments for the other analog storms will give a 
distribution about the actual position. 

In the first test using dependent sample data, scatter dia- 
grams were prepared showing positions of analogs relative to 
the actual 24-, 28-, 72-h locations (see figures 6,7, and 8). 

This was done for positions before and after regression to in- 
dicate the advantage of the regression adjustment over simply 
translating the analog. There is a reduction in the dispersion 
of the regressed locations, especially at 48 h. This is also 
demonstrated in Table 4 , where the sample standard deviations 
of the predictands are shown to be reduced after regression. 
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DYRG72 DYER72 
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6. Scatter plot diagram of the predictands before 
regression (DXER72, DYER72) and. after regression 
adjustment of the analogs (DXRG72^ DYRG72). for the 
72-h positions. 
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DYRGU8 DTER148 
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7 . Same 




DXER48 




DXPG48 



as figure 6 except for the 48-h positions. 
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DYRG24 DYER2U 




DXER24 




DXRG214 



8. Same as figure 6 except for the 24-h positions. 
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TABLE 4 



Sample standard deviations of AX, AY before and after regres- 
sion adjustment for 24, 48, and 72 h (nautical miles). 



Predictand 

“24 

“24 

“48 

“48 

“72 

“72 



Standard 

Before Regression 
132.2 

108.8 

308.8 

230.9 

471.4 

351.9 



Deviation 

After Regression 

103.4 

98.9 

262.8 

209.9 

345.8 

311.5 
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In a second test, regression-adjusted forecasts and weighted 
regression-adjusted forecasts were made. These forecasts and 
the TYAN78 forecasts were compared to the corresponding best 
track positions and vector errors computed. The means and 
standard deviations of these errors are compared with those 
of the official forecasts in Table 5. The TYAN78 vector fore- 
cast errors at all time intervals are smaller than the official 
forecast errors for this sample. The regression-adjusted vector 
forecast errors are in turn smaller than those for TYAN78. The 
standard deviation for the regression-adjusted forecast errors 
is likewise smaller except at 72 h. Though this standard devia- 
tion for the regression-adjusted forecast error is larger than 
that for TYAN78, the 72-h position is better with respect to 
the mean position. 

B. INDEPENDENT SAMPLE TEST 

The ultimate test of the multiple linear regression equations 
is the comparison of the forecast errors for an independent 
data set. For the independent test a sample from 1979 was 
used. As in the selection of the dependent set, randomness 
was desired, but was not guaranteed. The same selection criteria 
was followed for the independent sample as for the dependent 
sample. The input storm was required to have at least a 108-h 
history and official forecast error statistics for 24, 48, and 
72 h from the input initial position. A total of 18 storms 
were selected as the independent sample. Three were dropped 
from the original 18 . One was dropped because the ARQ was 
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TABLE 5 



Vector length error statistics (nautical miles) comparing 
official forecast errors, TYAN78 errors and regression 
adjusted forecast errors for the dependent sample. 



24 h FORECAST 

OFFICIAL 

TYAN78 

AVERAGED REGRESSION 
WEIGHTED REGRESSION 



MEAN (NM) 


STANDARD 

DEVIATION 




(NM) 



119.8 


70.3 


77.7 


53.8 


68.1 


42.8 


68.8 


43.8 



48 h FORECAST 






OFFICIAL 


238.8 


110.3 


TYAN78 


160.2 


88.5 


AVERAGED REGRESSION 


147.0 


83.3 


WEIGHTED REGRESSION 


148.9 


82.9 



72 h FORECAST 






OFFICIAL 


438.1 


198.3 


TYAN78 


320.8 


151.4 


AVERAGED REGRESSION 


288.0 


162.1 


WEIGHTED REGRESSION 


287.3 


161.0 
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rejected. Another was dropped because none of the top 20 
analogs existed for 72 h past the "best” position. The third 
was dropped because only one analog was chosen and it did not 
exist for 72 h past the "best" position. 

The final set of independent storms and their potential 
analogs was 15 storms with 199 analog storms (see figure 9 
and list in Table 7) . These storms and the analog storms were 
processed as in the dependent sample test. The resulting vector 
error statistics are compared in Table 6. 

The means and standard deviations for the regression-adjusted 
forecast errors, as well as the TYAN78 forecast errors, were 
less than those for the official forecast errors for all time 
intervals. Although the mean errors for the regression-adjusted 
forecasts and TYAN78 forecasts were equivalent at 24 h, the 
stnadard deviation was reduced in the regression-adjusted fore- 
casts. For the 48- and 72-h regression-adjusted forecasts, the 
standard deviations of the errors are greater than those for 
the TYAN78 forecasts. However, the 48- and 72-h regression- 
adjusted positions are better relative to the mean position. 

It should be noted that the TYAN78 uses a larger sample 
of analog storms and has a weighting function based on the 
similarity index. Thus a direct comparison of the standard 
deviations is not possible. To get such a comparison, the 
present rotation scheme in TYAN78 would have to be replaced 
by the regression-adjustment scheme and then proceed with the 
remainder of the TYAN78 forecast weighting scheme. This 
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Map of the independent sample input "current 
positions. 



TABLE 6 



Vector length error statistics (as in Table 5) except for 
the independent sample. 





MEAN (NM) 


STANDARD 

DEVIATION 

(NM) 


24 h FORECAST 


OFFICIAL 


116.6 


58.6 


TYAN78 


83.2 


48.6 


AVERAGED REGRESSION 


84.9 


42.9 


WEIGHTED REGRESSION 


84.3 


42.4 



48 h FORECAST 






OFFICIAL 


205.7 


141.7 


TYAN78 


190.9 


68.3 


AVERAGED REGRESSION 


155.7 


93.2 


WEIGHTED REGRESSION 


156.4 


93.9 



72 h FORECAST 






OFFICIAL 


341.7 


278.4 


TYAN78 


337.6 


158.9 


AVERAGED REGRESSION 


307.7 


172.2 


V7EIGHTED REGRESSION 


312.5 


169.7 
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TABLE 7 



List of dependent and independent sample storms used in the 
research. 



DEPENDENT SAMPLE 



NAME 


DATE-TIME-GROUP 
(YYMMDDHH GMT) 


INITIAL 

(DEGREES 


POSITION 

LAT/LON) 


MAXIMUM 

SURFACE 

WIND 

(KTS) 


MINIMUM 

SLP 

(MB) 


SARAH 


77071812 


15. 9N 


118. 3E 


50 


986 


THELMA 


77072300 


17.5 


123.8 


70 


971 


DINAH 


77091718 


17.3 


116.3 


55 


981 


GILDA 


77100606 


23.8 


149.2 


60 


975 


IVY 


77102318 


20.4 


149.0 


65 


971 


KIM 


77111018 


14.8 


132.2 


125 


923 


LUCY 


77120106 


8.4 


140.3 


35 


998 


OLIVE 


78042206 


14.3 


115.6 


60 


979 


TRIX 


78071400 


23.6 


143.3 


35 


993 


VIRGINIA 


78072400 


16.6 


151.0 


45 


990 


WENDY 


78072612 


21.9 


133.1 


70 


969 


CARMEN 


78081218 


18.8 


142.4 


55 


982 


* ELAINE 


78082218 


17.9 


124.6 


25 


1005 


* IRMA 


78091118 


24.1 


122.7 


30 


996 


JUDY 


78091300 


24.6 


14 6.7 


40 


990 


LOLA 


78092912 


17.0 


113.5 


70 


971 


MAMIE 


78100106 


22.4 


152.3 


45 


989 


ORA 


78101112 


20.9 


127.4 


55 


980 


PHYLLIS 


78101812 


20.9 


155.3 


90 


950 


RITA 


78102200 


11.2 


153.5 


115 


946 


VIOLA 


78111918 


13.5 


137.3 


60 


981 



*Used in developing regression equations but not in testing be- 
cause it had no official forecast error data. 



INDEPENDENT SAMPLE 



BESS 


79032012 


10. 8N 


139. 9E 


36 


997 


CECIL 


79041406 


8.4 


130.4 


63 


983 


DOT 


79051118 


10.2 


122.3 


26 


1000 


ELLIS 


79070100 


13.4 


132.0 


35 


996 


FAYE 


79070400 


10.3 


139.9 


46 


991 


HOPE 


79072806 


17.0 


137.4 


35 


996 


IRVING 


79081406 


24.0 


124.9 


80 


956 


JUDY 


79081818 


17.5 


136.6 


107 


939 


MAC 


79091800 


13.9 


123.8 


50 


988 


OWEN 


79092500 


20.9 


131.0 


72 


965 


SARAH 


79100812 


11.0 


119.0 


58 


984 


TIP 


79100900 


12.0 


147.7 


52 


988 


VERA 


79110400 


10.5 


133.1 


97 


956 


WAYNE 


79110918 


16.9 


129.3 


43 


989 


ABBY 


79121006 


11.8 


133.1 


64 


979 
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would insure that the same number of analog storms are con- 
sidered, and the similarity index weighting would be applied 
to both sets. 

This limited comparison of the regression-adjusted fore- 
cast vector error standard deviations with those of the official 
forecast does seem to represent an increase in skill of the 
regression adjustment scheme over the official forecast. 
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V. CONCLUSIONS 



The objective of this research was to adjust statistically 
the analogs selected by TYAN78 and produce improved forecast 
vector errors. Frill (1979) showed improvement in forecasts 
from the TCM with equations containing predictors based on 
backward integration of the TCM. A similar approach was used 
here in that the majority of the predictors selected in the 
stepwise regression program involved past position and motion 
information. 

Multiple linear regression equations derived here reduce 
the mean and variance of the vector errors of tropical cyclone 
track forecasts. The inclusion of past position predictors 
in all of the equations indicates the role that persistence 
has in the predictions. However, the difference in the .past 
12 -h motion of the storm and the analog storm is not simply 
extrapolated in time, as is the case in the present TYAN78 
scheme . 

An earlier version of the regression equations was based 
on only 11 input storms with only 161 analogs. Many of the 
same predictors were selected in that sample, but the co- 
efficients were different. This set did not produce stable 
regression coefficients, because an independent sample resulted 
in degraded adjustments to the analog tracks. 
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If the TYAN78 model had not selected the input storm as an 
analog of itself, the sample size could have been increased. 

Thus, more stable regression equations could have been 
developed. 

As described before, the regression equations were developed 
using analogs selected under the ARQ option for all storms. 

It is suggested that multiple linear regression equations be 
developed for each of the other options (recurver and non- 
recurver) . Another possibility of improvement in the regression 
equations would be to use the earlier forecast position as a 
predictor for following positions. That is, one could explore 
the use of regression equations with the 24-h forecast position 
or related velocity components as a predictor for the 48- and 
72-h forecasts. Similarily, the 72-h forecast would have avail- 
able information regardingthe 24- and 48-h forecast position. 

This would assure consistency in the adjustments at the various 
intervals . 
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APPENDIX A 



METHODS : 

The following section describes the methods used in obtaining 
and processing the data for this research. 

Program NWPAC in TYAN78 was modified to allow the model to 
select and print out the top 20 analog storms. Subroutine 
NEXRPT was altered to force the TYAN78 program to print out 
all potential analog storms read during the historical search. 

The output from the computer jobs at FNOC was manually 
searched for the top 20 analog storms. These were separated 
from the listing, marked with the rank and the "best" position 
identified. All analog storms listed were actual best tracks 
with positions given every six hours. 

Most of the data was entered as card images on the W.R. 
Church computer center time-sharing system, then later punched 
onto cards. Once the raw data were on cards a track adjust- 
ment program was run to translate the analog storms. The 
track adjustment program produced punched cards with the trans- 
lated positions. 

The translated data were used in a predictor/predictand 
computing program which stored the computed values on disk. 

The BMDP2R program [Dixon and Brown, 1977] was run 42 times 
using the data from disk. The resulting equations are for 
intermediate times as well as the times described in the main 
text. All the equations were inserted in the evaluation program 
with resulting output described in the main text. 
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